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GERANIOL SYNTHASE, METHODS OF PRODUCTION 

AND USES THEREOF 

STATEMENT OF GOVERNMENTAL SUPPORT 
5 This invention was made with Governmental Support Work Order Award No. IS- 

3322-02C, USD A NRICGP 2001-35318-10006, N awarded by the U. S. Department of 
Agriculture. The U. S. Government has certain rights in this invention. 

FIELD OF THE INVENTION 
10 The present invention relates to Geraniol synthase (GES), a key enzyme in the 

production of the monoterpene aroma compound geraniol. More specifically, the present 
invention relates to nucleic acid sequences coding for GES from sweet basil plant species 
as well as to vectors containing the sequences, to host cells containing the sequences and 
to methods of producing recombinant GES, its products, and uses thereof. 

15 

BACKGROUND OF THE INVENTION 
Flavors and aromas, many of which originate from plants, have always had an 
important role in human culture. Ancient cultures cultivated and prized plants for their 
nutritional value as well as for their flavor, aroma and medicinal properties. However, 

20 the development of commercial large-scale agriculture in western civilizations resulted in 
emphasizing commercial/marketing interests in plant production, such as long shelf life, 
physical appearance and yield. The content of secondary metabolites, i.e. metabolites 
which do not have a defined metabolic role and presence of which is restricted to specific 
tissues, were often overlooked, although they significantly affect the nutritional value 

25 and aroma. Nowadays, there is growing awareness to healthy and flavorful plant 
products. Physically appealing but flavorless and aroma-less fruit is perceived as 
"synthetic", while scented fruit is perceived as more "natural". There is increasing public 
interest to "return" the natural flavor and aroma to fruits, emphasizing the importance of 
elucidating the relevant biosynthetic pathways, enzymes, genes and regulatory 

30 mechanisms involved. Cosmetic and food industries also seek natural aroma and flavors. 

The monoterpene geraniol, which is emitted from flowers, notably roses (Rao et 
al 2000) and herbs (Mockute et al 1999) of many species, has an important role in their 

l 
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overall flavor and aroma. The mixture of geraniol's oxidation products, geranial and 
neral, also called citral, imparts a "lemon" flavor, and lemongrass (Singh 
Sangwan et al 1993), ginger (Miyazawa et al 1998), and some varieties of sweet basil 
(Grayer et al 1996) such as "Sweet Dani" are particularly rich in citral (Morales et al 
5 1997). However, much is still lacking in the study of the physiological, biochemical 
and genetic regulation of the production of aroma compounds within the plants. 

Terpenoids are found in all plant species and have diverse physiological roles 
such as phytoalexins, pest deterrents and toxins, growth regulators, pollinator attractants, 
photosynthetic pigments and electron acceptors (Gershenzon and Croteau, 1993; 

10 Chappell, 1995; McGarvey and Croteau, 1995). For example, US Patent No. 

6,258,602 discloses the isolation and bacterial expression of a sesquiterpene synthase 
cDNA clone from peppermint that produces the aphid alarm pheromone E-betafarnesene. 

Monoterpenes are also one of the important groups of secondary metabolites 
involved in herb, fruit and flower aromas. While the backbones of the biosynthetic 

15 pathways leading to production of monoterpenes are ubiquitous to all plant species, the 
composition of terpenes often differs dramatically between species or even varieties 
leading to the diversity of aroma and flavors among herbs and fruits. This diversity 
seems to stem mainly from the specific composition and expression of the key-enzymes 
in the biosynthetic pathway, the terpene synthases, and additional downstream 

20 modification enzymes. Homology analysis revealed that while sequence conservation 
was not high among terpene synthases of different plant species, discrete conserved 
domains were present suggesting significant structural and functional similarity (Back 
and Chappell, 1995; Steele et al., 1998). These conserved domains have been the basis 
for isolation of a number of terpene synthases encoding genes from a variety of plant 

25 species using degenerate-primer based RT-PCR (Bohlmann et ah, 1998a, 1998b, 1999; 
Steele et al., 1998). 

Geraniol is likely to be synthesized from geranyl diphosphate (GDP), the 
universal precursor of all monoterpenes. Two types of enzymatic reactions have been 
hypothesized to lead to geraniol synthesis from GDP, either a phosphate-based or a 

30 monoterpene-based catalysis. However, in the absence of purified and characterized 

geraniol synthase, the question of whether GES employs a similar mechanism to the one 
used by other monoterpene synthases remained unsolved. This state of research limits the 
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ability of the agricultural, food and cosmetic industries to use natural herb aromas, which 
are highly desirable in products of these industries. 

Thus, there is a recognized need for, and it would be highly advantageous to have 
specific compounds involved in the regulation of herbal aroma pathways, and more 
5 advantageous to have isolated polynucleotides and enzymes capable of producing said 
aromas. 

SUMMARY OF THE INVENTION 
The present invention relates to a key enzyme in the production of the monoterpene 
10 geraniol, an aroma compound found mainly in flower and herb species. 

The present invention provides a novel type of the enzyme geraniol synthase 
(GES), which is involved in the terpene biosynthetic pathway converting geranyl 
diphosphate (ODP) to geraniol. According to one aspect, the monoterpene synthase is 
GES. The present invention also provides polynucleotide sequences encoding the GES, 
15 including recombinant DNA molecules. The present invention further provides vectors 
and host cells, including vectors comprising the polynucleotides of the present invention, 
host cells engineered to contain the polynucleotides of the present invention and host 
cells engineered to express the polynucleotides of the present invention. Thus, the 
present invention provides methods for (i) expressing the recombinant monoterpene 
20 synthase, specifically GES, to facilitate the production, isolation and purification of 

significant quantities of recombinant GES, or of its primary and secondary products for 
subsequent use; (ii) expressing or enhancing the expression of a monoterpene synthase, 
specifically GES, in microorganisms or in plants; and (iii) regulating the expression of a 
monoterpene synthase, specifically GES, in an environment where such regulation of 
25 expression is desired for the production of the enzyme and for producing the enzyme 
products and derivatives thereof. 

The present invention further provides polynucleotide sequences encoding 
monoterpene synthase, specifically GES, for use in a variety of methods and techniques 
known to those skilled in the art of molecular biology, including, but not limited to the 
30 use as hybridization probes, as oligomers for PCR, for chromosome and gene mapping, 
and the like. 
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The present invention also provides methods for using monoterpene synthase 

« 

enzymatic products, specifically geraniol, in the agricultural, cosmetic and food 
industries. 

According to one aspect, the present invention provides monoterpene synthase 
characterized in that it converts geranyl diphosphate (GDP) to geraniol, and 
polynucleotides encoding same. 

According to one embodiment, the present invention provides an isolated 
polynucleotide comprising a genomic, complementary or composite polynucleotide 
sequence encoding a GES, said GES being capable of converting GDP to geraniol. 

According to one embodiment, the present invention provides an isolated 
polynucleotide comprising a nucleic acid sequence selected from the group consisting of: 

(a) the nucleic acid sequence of SEQ ID NO:l (Fig. 1); 

(b) the complement of SEQ ID NO:l; 

(c) a nucleic acid sequence which is at least 90% homologous to SEQ ID NO:l; and 

(d) a nucleic acid sequence capable of hybridizing either (a) or (b). 
According to one another embodiment, the present invention provides a 

polynucleotide comprising a nucleic acid sequence encoding a GES comprising the 
amino acid sequence of SEQ ID NO:2 (Fig. 2). 

According to one embodiment, the present invention provides an isolated 
polynucleotide comprising a nucleic acid sequence encoding a GES which is at least 
60%, preferably at least 70%, more preferably at least 80% or more, most preferably at 
least 90% or 100% homologous (similar + identical amino acids) to the amino acid 
sequence set forth in SEQ ID NO:2. 

According to yet another embodiment, the present invention provides an isolated 
polynucleotide comprising a nucleic acid sequence encoding a GES comprising the 
amino acid sequence of SEQ ID NO:2, or fragments, derivatives and analogs thereof. 
The present invention also provides an isolated polynucleotide comprising a nucleic acid 
sequence which hybridizes to the polynucleotide encoding GES comprising the amino 
acid sequence of SEQ ID NO:2 or fragments, derivatives and analogs thereof. The 
present invention further provides an isolated polynucleotide comprising a nucleic acid 
sequence which is complementary to the polynucleotide encoding a GES comprising the 
amino acid sequence of SEQ ID NO:2 and fragments, derivatives and analogs thereof 



WO 2005/060553 



PCTYUS2004/040321 



According to one another embodiment, the present invention provides a 
polypeptide having GES activity, said activity being characterized by converting geranyl 

diphosphate (GDP) to gfcraniol. 

According to one embodiment, the present invention provides a GES having the 

amino acid sequence of SEQ ED NO: 2. 

According to another embodiment, the present invention provides a polypeptide 
which is at least 60%, preferably at least 70%, more preferably at least 80% or more, 
most preferably at least 90% or 100% homologous (similar + identical amino acids) to 
the amino acid sequence set forth in SEQ ID NO:2, said polypeptide is a GES. 

According to one embodiment, the GES of the present invention is originated from 
herb species, specifically from oranges basil cultivar "Sweet Dani". 

According to another aspect, the present invention provides methods for the 
production, isolation and purification of the GES, as well as of the products of its 
enzymatic activity. 

According to one embodiment, the present invention provides an expression vector 
comprising a nucleic acid sequence encoding a GES. 

According to another embodiment, the present invention provides an expression 
vector comprising a nucleic acid sequence selected from the group consisting of: 

(a) the nucleic acid sequence of SEQ ID NO:l; 

(b) the complement of SEQ ID NO: 1; 

(c) a nucleic acid sequence which is at least 90% homologous to SEQ ID NO:l; 
and 

(d) a nucleic acid sequence capable of hybridizing either (a) or (b). 

According to yet another embodiment, the present invention provides an expression 
vector comprising a polynucleotide sequence encoding a GES having SEQ ID NO:2. 

According to further embodiment, the present invention provides an expression 
vector comprising a polynucleotide sequence encoding a GES which is at least 60%, 
preferably 70%, more preferably 80% or more, most preferably at least 90% or 100% 
homologous to SEQ ID NO:2. 

According to one another embodiment the present invention provides a host ceD 
containing the expression vector of the present invention. The present invention further 
provides a method for producing recombinant GES, the method comprising a) culturing 
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the host cell containing the expression vector comprising at least a fragment of the 
polynucleotide sequence encoding GES under conditions suitable for the expression of 
the enzyme; and b) recovering the enzyme from the host cell culture. According to a 
further embodiment the present invention provides a method for producing significant 

5 amounts of geraniol, the method comprising a) culturing the host cell containing an 
expression vector comprising at least a fragment of the polynucleotide sequence 
encoding GES under conditions suitable for the expression and activity of the enzyme; 
and b) recovering geraniol from the host cell culture. 

The geraniol produced within a host cell according to the present invention can 

10 serve as a substrate for producing additional compounds by enzymes present in the host 
cell active downstream to GES in the terpene biosynthesis pathway. Such compounds are 
designated herein as "geraniol metabolites". 

According to one embodiment the present invention provides a method for 
producing significant amounts of geraniol metabolites downstream in the terpene 

15 biosynthesis pathway the method comprising a) culturing the host cell containing an 
expression vector comprising at least a fragment of the polynucleotide sequence 
encoding GES under conditions suitable for the expression and activity of the GES; and 
b) recovering geraniol metabolites from the host cell culture. 

According to one preferred embodiment the geraniol metabolites are geranial and 

20 neral. 

Prokaryotic as weil as eukaryotic expression systems may be utilized for the 
production of GES and its product geraniol, and geraniol metabolites downstream in the 
terpene biosynthesis pathway. Both systems comprise the necessary elements for 
posttranslational modification enabling the proper activity of the enzyme, as well as the 
25 necessary substrates for the synthesis of geraniol and the enzymes for the synthesis of 
downstream geraniol metabolites. 

According to yet another aspect the present invention provides a prokaryotic 
organism in which significant amounts of geraniol are synthesized. 

According to one embodiment, the present invention provides a prokaryotic 
30 organism comprising a polynucleotide sequence encoding a GES stably integrated into 
its genome. 
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According to one embodiment, the present invention provides a prokaryotic 
organism comprising a nucleic acid sequence selected from the group consisting of SEQ 
ID NO:l, the complement of SEQ ID NO:l, a nucleic acid sequence which is at le£st 
90% homologous to SEQ ID NO:l, a nucleic acid sequence capable of hybridizing to 
5 SEQ ID NO:l and a nucleic acid sequence capable of hybridizing to the complement of 
SEQ ID NO:l, stably integrated into its genome. According to another embodiment, the 
present invention provides a prokaryotic 

organism comprising a polynucleotide sequence encoding a GES having the amino acid 

sequence of SEQ ID NO:2 stably integrated into its genome. 
10 According to yet another embodiment the present invention provides a prokaryotic 

organism comprising a polynucleotide sequence encoding a GES comprising amino acid 

sequence which is at least 60%, preferably at least 70%, more preferably at least 80% or 

more, most preferably at least 90% or 100% homologous to SEQ ID NO: 2 stably 

integrated into its genome. 
15 According to yet further embodiment, the present invention provides a prokaryotic 

organism comprising a polynucleotide encoding GES stably integrated into its genome, 

said prokaryotic organism produces geraniol. 

According to one preferred embodiment, the prokaryotic organism is selected from 

the group consisting various bacteria; preferably, the prokaryotic organism is E. colt 
20 According to yet another aspect, the present invention provides geraniol and 

geraniol metabolites obtained by the methods of the present invention for industrial uses. 
According to one embodiment, the present invention provides geraniol and geraniol 

metabolites obtained by the methods of the present invention for use in a product 

selected from the group consisting of agricultural, cosmetic, and food products. 
25 According to another embodiment, the geraniol metabolites are geranial and neral. 

The present invention is explained in greater detail in the description, figures and claims 

that follows. 

BRIEF DESCRIPTION OF THE FIGURES 
30 FIG. 1 shows the nucleotide sequence of GES, SEQ ID NO: 1 . 
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FIG. 2 shows the relatedness of GES to other terpene synthases. Amino acid sequence of 
basil GES (SEQ ID NO:2) is compared with two most similar terpene synthases for 
which a function is known, 1,8, cineol synthase from Salvia officinalis and 4S-limonene 
synthase from Mentlia spicata. Residues identical in at least two of the proteins at a 
given position are shaded. lines above the sequence indicate the tryptic peptides of GES 
identified by ESI-MS/MS. The RRX8W motif (which is absent in GES) is double- 
underlined. Serine 35 and methionine 44 in GES are underlined and in bold. 

FIG. 3 shows the position of Geraniol synthase on the phylogenetic map of terpene 
synthases. Alignment was performed using Clustalx and phylogenetic tree was drawn 
using Tree View. 

FIG. 4 shows identification of monoterpenes extracted from leaves and glands of sweet 
basil cultivar "Sweet Dani" and produced by a crude protein extract from the glands by 
GC-MS. (Fig. 4 A) Extraction from young leaves. (Fig. 4B) Extraction from glands. (Fig. 
4C) Extraction of product after an enzyme assay with crude protein extract incubated 
with GDP. Labeled peaks are: 1: Internal standard included for quantification, 2: P- 
caryophyUene, 3:methyl chavicol, 4: neral, 5: GermacreneD, 6:geranial, 7: a-farnesene, 
8: nerol, 9: geraniol. 

Unlabeled peaks in (C) are not terpenes, and are found in the protein crude extract 
regardless of whether GDP is included in the assay. 

FIG. 5 shows SDS-PAGE analysis of purified basil GES from the leaf glands and from 
E. coli expression system. Lane 1, purified GES after Superose-12 size-exclusion 
chromatography. Lane 2, MonoQ-purified truncated GES (starting from Met44) 
produced in E. coll Lane 3, MonoQ-purified truncated GES (starting from Ser35) 
produced in E. colt Lanes marked "M" contain molecular weight markers. Gels were 
stained with Coomassie Blue. 

FIG. 6 shows analysis of the products of the reaction catalyzed by purified GES with 
GDP. (Fig. 6A) Gas chromatographic separation of authentic standards of neral, geranial, 
nerol, and geraniol. (Fig. 6B) SPME-Gas chromatogram of the reaction solution 
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following catalysis by gland-purified basil GES. Only a single peak was observed, 
identified as geraniol by MS. (Fig. 6C) SPME-Gas chromatogram of the reaction 
solution following catalysis by basil GES purified from the E. coli expression system. 
Only a single peak was observed, identified as geraniol by MS. 

FIG. 7 compares mass spectra of geraniol produced in enzymatic reactions with purified 
GES in buffers containing either non-isotopic water (H 2 16 0) or ls O-labeled water (H 2 
ls O). (Fig. 7A) Mass spectra of a geraniol standard. (Fig. 7B) Mass spectra of geraniol 
from a reaction including non-isotopic water (H 2 16 0). (Fig. 7C) Mass spectra of geraniol 
from a reaction containing ls O-labeled water (H 2 ls O). Arrows indicate the mass of 
fragments containing oxygen, 154/156 (M+), 139/141 (M^CEb), and 111/113 (M*- 
C 3 H 7 ). 

FIG. 8 describes the Reaction mechanism of GES. Geraniol is not generated by 
phosphatase activity from GDP, but is formed by the addition of a hydroxyl group to a 
carbocation intermediate. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to novel types of monoterpene synthases, specifically 
to geraniol synthase. 

The present invention relates to a key enzyme in the production of the monoterpene 
geraniol, an aroma compound found mainly in flower and herb species. 

The present invention discloses the isolation and characterization of a key 
gene/enzyme in flower and herb aroma formation: geraniol synthase (GES). 
Recombinant enzyme activity in-vitro shows a single monoterpene product identified as 
geraniol. 

Definitions 

As used herein, the terms "amino acid" and "amino acids" refer to all naturally 
occurring L-a-amino acids or their residues. The amino acids are identified by either the 
single-letter or three-letter designations. 
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As used herein, the term "nucleotide" means a monomeric unit of DNA or RNA 
containing a sugar moiety (pentose), a! phosphate and a nitrogenous heterocyclic base. 
The base is linked to the sugar moiety via the glycosidic carbon (1' carbon of pentose) 
and that combination of base and sugar is called a nucleoside. The base characterizes the 
nucleotide with the four bases of DNA being adenine ("A"), guanine ("G"), cytosine 
("C"), thymine ("T") and inosine ("T). The four RNA bases are A,G,C and uracil ("U"). 
The nucleotide sequences described herein comprise a line array of nucleotides 
connected by phosphodiester bonds between the 3' and 5' carbons of adjacent pentoses. 

The term "percent identity" means the percentage of amino acids or nucleotides 
that occupy the same relative position when two amino acid sequences, or two nucleic 
acid sequences are aligned side by side. 

The term "percent similarity" is a statistical measure of the degree of relatedness of 
two compared protein sequences. The percent similarity is calculated by a computer 
program that assigns a numerical value to each compared pair of amino acids based on 
chemical similarity (e.g., whether the compared amino acids are acidic, basic, 
hydrophobic, aromatic, etc.) and/or evolutionary distance as measured by the minimum 
number of base pair changes that would be required to convert a codon encoding one 
member of a pair of compared amino acids to a codon encoding the other member of the 
pair. Calculations are made after a best fit alignment of the two sequences has been made 
empirically by iterative comparison of all possible alignments. (Henikoff, S. and 
Henikoff, J. G., Proc, Nat'l. Acad. Sci. USA 89:10915-10919, 1992). 

"Oligonucleotide" refers to short length single or double stranded sequences of 
deoxyribonucleotides linked via phosphodiester bonds. The oligonucleotides are 
chemically synthesized by known methods and purified on polyacrylamide gels. 

The term "Monoterpene synthase" is used herein to mean an enzyme capable of 
catalyzing the production of monoterpene from GDP. "Geraniol synthase" (GES) is used 
herein to mean an enzyme that catalyzes the production of the monoterpene geraniol 
from GDP. 

The terms "alteration", "amino acid sequence alteration", "variant" and "amino 
acid sequence variant" refer to GES molecules with some differences in their amino acid 
sequences as compared to GES from specific flowers and herbs, especially GES of the 
basil cultivar "Sweet Dani" having the amino acid sequence set forth in SEQ ID N0:2. 

10 
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Ordinarily, the variants will possess at least about 70% homology, preferably at least 
about 80% homology with the above defined GES. The amino acid sequence variants of 
GES falling within this invention possess substitutions, deletions, and/or insertions at 
certain positions. Sequence variants of GES may be used to attain desired enhanced 
enzymatic activity or altered substrate utilization or product distribution. Substitutional 
GES variants are those that have at least one amino acid residue 
in the GES sequence set forth in SEQ ID N0:2 removed and a different amino acid 
inserted in its place at the same position. The substitutions may be single, where only one 
amino acid in the molecule has been substituted, or they may be multiple, where two or 
more amino acids have been substituted in the same molecule. Substantial changes in the 
activity of the GES molecules of the present invention may be obtained by substituting 
an amino acid with a side chain that is significantly different in charge and/or structure 
from that of the native amino acid. This type of substitution would be expected to affect 
the structure of the polypeptide backbone and/or the charge or hydrophobicity of the 
molecule in the area of the substitution. 

Moderate changes in the activity of the GES molecules of the present invention 
would be expected by substituting an amino acid with a side chain that is similar in 
charge and/or structure to that of the native molecule. This type of substitution, referred 
to as a conservative substitution, would not be expected to substantially alter either the 
structure of the polypeptide backbone or the charge or hydrophobicity of the molecule in 

the area of the substitution. 

Insertional GES variants are those with one or more amino acids inserted 
immediately adjacent to an amino acid at a particular position in the amino acid sequence 
of GES set forth in SEQ ID NO:2. Immediately adjacent to an amino acid means 
connected to either the a-carboxy or a-amino functional group of the amino acid. The 
insertion may be one or more amino acids. Ordinarily, the insertion will consist of one or 
two conservative amino acids. Amino acids similar in charge and/or structure to the 
amino acids adjacent to the site of insertion are defined as conservative. Alternatively, 
this invention includes insertion of an amino acid with a charge and/or structure that is 
substantially different from the amino acids adjacent to the site of insertion. 

Deletional variants are those where one or more amino acids in the amino acid 
sequence of GES set forth in SEQ ID N0:2 have been removed. Ordinarily, deletional 
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variants will have one or two amino acids deleted in a particular region of the GES 
molecule. 

The term biological activity", biologically active", "activity" and "active" refer 
to the ability of the monoterpene synthase to convert geranyl diphsphate (GDP) to a 
5 group of monoterpenes, of which geraniol is the principle and characteristic monoterpene 
synthesized by GES. 

The terms "DNA sequence encoding", "DNA encoding", "nucleic acid encoding" 
or "polynucleotide sequence encoding" refer to the order or sequence of 
deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 
10 deoxyribonucleotides determines the order of amino acids along the translated 
polypeptide chain. The DNA sequence thus codes for the amino acid sequence. 

The term "hybridization", as used herein, refers to any process by which a strand of 
nucleic acid binds with a complementary strand through base pairing. 

The terms "stringent conditions" or "stringency", as used herein, refer to the 
15 conditions for hybridization as defined by the nucleic acid, salt, and temperature. These 
conditions are well known in the art and may be altered in order to identify or detect 
identical or related polynucleotide sequences. Numerous equivalent conditions 
comprising either low or high stringency depend on factors such as the length and nature 
of the sequence (DNA, RNA, base composition), nature of the target (DNA, RNA, base 
20 composition), milieu (in solution or immobilized on a solid substrate), concentration of 
salts and other components (e.g., formamide, dextran sulfate and/or polyethylene glycol), 

o o 

and temperature of the reactions (within a range from about 5 C to about 25 C below the 
melting temperature of the probe). One or more factors may be varied to generate 
conditions of either low or high stringency. 

25 The terms "replicable expression vector" and "expression vector" refer to a piece of 

DNA, usually double-stranded, which may have inserted into it a piece of foreign DNA. 
Foreign DNA is defined as heterologous DNA, which is DNA not naturally found in the 
host. The vector is used to transport the foreign or heterologous DNA into a suitable host 
cell. Once in the host cell, the vector can replicate independently of or coincidental with 

30 the host chromosomal DNA, and several copies of the vector and its inserted (foreign) 
DNA may be generated. In addition, the vector contains the necessary elements that 
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permit translating the foreign DNA into a polypeptide. Many molecules of the 
polypeptide encoded by the foreign DNA can thus be rapidly synthesized. 

The terms "transformed host cell," "transformed" and "transformation" refer to the 
introduction of DNA into a cell. The cell is termed a "host cell", and it may be a 
5 prokaryotic or a eukaryotic cell. Typical prokaryotic host cells include various strains of 
E. coll Typical eukaryotic host cells are plant cells, yeast cells, insect cells or animal 
cells. The introduced DNA is usually in the form of a vector containing an inserted piece 
of DNA. The introduced DNA sequence may be from the same species as the host cell or 
from a different species from the host cell, or it may be a hybrid DNA sequence, 
10 containing some foreign DNA and some DNA derived from the host species. 

Novel monotemene synthase 

According to one aspect, the present invention relates to polynucleotides encoding 
for monoterpene synthase, specifically GES. 

15 As described herein below, the present invention shows that the only product of 

geraniol synthase is geraniol, a reaction mechanism which is similar to the reaction 
mechanism of other terpene synthases. Therefore, a large EST database constructed from 
the peltate glands of three basil varieties, including Sweet Dani, for potential GES 
cDNAs, were examined. Two of these cultivars, EMX1 and SW, do not produce geraniol 

20 (or citral) but they do produce other monoterpenes, such as 1,8-cineole, linalool, and 
fenchone. BLAST searches identified 5 different types of cDNA sequences in Sweet 
Dani with sequence homology to known terpene synthases, but only one type of 
sequence was significantly unique to Sweet Dani, encoding a protein that was highly 
divergent from any terpene synthase-like sequence found in the other cultivars (less than 

25 35% identity), whereas the other four cDNAs encoded proteins that were >90% identical 
to proteins from the SW and EMXI varieties. A complete cDNA of this sequence, 
obtained by 5' RACE followed by RT-PCR, contains an open reading frame of 1701 
nucleotides that encodes a protein of 567 amino acids (Fig. 1). 

According to one embodiment, the present invention provides an isolated 

30 polynucleotide comprising a genomic, complementary or composite polynucleotide 
sequence encoding a GES, said GES being capable of converting GDP to geraniol. 
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According to one embodiment, the present invention provides an isolated 
polynucleotide comprising a nucleic acid sequence selected from the group consisting of: 

(a) the nucleic acid sequence of SEQ ID NO:l; 

(b) the complement of SEQ ID NO: 1 ; 

5 (c) a nucleic acid sequence which is at least 90% homologous to SEQ ID NO: 1; and 

(d) a nucleic acid sequence capable of hybridizing either (a) or (b). 
According to one another embodiment, the present invention provides a 
polynucleotide comprising a nucleic acid sequence encoding a GES comprising the 
amino acid sequence of SEQ ID NO:2. 

10 According to one embodiment, the present invention provides an isolated 

polynucleotide comprising a nucleic acid sequence encoding a GES which is at least 
60%, preferably at least 70%, more preferably at least 80% or more, most preferably at 
least 90% or 100% homologous (similar + identical amino acids) to the amino acid 
sequence set forth in SEQ ID NO: 2. 

15 According to another embodiment, the present invention provides a polynucleotide 

encoding a GES comprising the amino acid sequence of SEQ ID NO: 2, or fragments, 
derivatives and analogs thereof. The present invention also provides a polynucleotide 
which hybridizes under stringent conditions to the polynucleotide encoding GES 
comprising the amino acid sequence of SEQ ID NO:2 or fragments, derivatives and 

20 analogs thereof. The present invention further provides a polynucleotide which is 
complementary to the polynucleotide encoding a GES comprising the amino acid 
sequence of SEQ ID NO:2 or fragments, derivatives and analogs thereof. 

The isolation of cDNA encoding GES permits the development of efficient 
expression systems for this functional enzyme; provides useful tools for examining the 

25 . developmental regulation of geraniol biosynthesis; permits investigation of the reaction 
mechanism(s) of this unique enzyme and permits the transformation of a wide range of 
organisms in order to introduce geraniol biosynthesis de novo, or to modify endogenous 
geraniol biosynthesis. 

According to another aspect, the present invention relates to polypeptides having 

30 monoterpene synthase activity, specifically GES activity, i.e., said polypeptides convert 
GDP to monoterpene, specifically geraniol. 
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From the complete deduced amino acid sequence of the GES cDNA clone (SEQ ID 
NO:2, Fig. 2), GES shares the highest sequence similarity to monoterpene synthases 
from Lamiaceae species (Wise et al 1998; Colby el al 1993). 

A phylogenetic analysis of the deduced amino-acid sequence of GES, suggests that 
GES occupies a highly divergent branch of the terpene synthase family, but it most likely 
shares a most recent common origin with the Snapdragon ocimene and Myrcene 
synthases and to C. breweri linalool synthase, all terpene synthases catalyzing the 
formation of acyclic monoterpenes (Figure 3) (ClustalX and Treeview software, 
Thompson, 1997 and Page, 1996, respectively). 

According to one another embodiment, the present invention provides a 
polypeptide having GES activity, said activity being characterized by converting geranyl 
diphosphate (GDP) to geraniol. 

According to one embodiment, the present invention provides a GES having the 
amino acid sequence of SEQ ID NO:2. 

According to another embodiment, the present invention provides a polypeptide 
which is at least 60%, preferably at least 70%, more preferably at least 80% or more, 
most preferably at least 90% or 100% homologous (similar + identical amino acids) to 
the amino acid sequence set forth in SEQ ID NO:2, said polypeptide is a GES. 

In addition to the native GES amino acid sequence, sequence variants produced by 
deletions, substitutions, mutations and/or insertions are intended to be within the scope 
of the invention. The GES amino acid sequence variants of this invention may be 
constructed by mutating the DNA sequence that encodes the wild-type synthase, such as 
by using techniques commonly referred to as site-directed mutagenesis. Nucleic acid 
molecules encoding the GES of the present invention can be mutated by a variety of PCR 
techniques well known to one of ordinary skill in the art. See, e.g., 'PCR Strategies", M. 
A. Innis, D. H. Gelfand and J. J. Sninsky, eds., 1995, Academic Press, San Diego, Calif. 
(Chapter 14); "PCR Protocols: A Guide to Methods and Applications", M. A. Innis, D. 

* 

H. Gelfand, J. J. Sninsky and T. J. White, eds., Academic Press, NY (1990). 

By way of non-limiting example, the two-primer system utilized in the Transformer 
Site-Directed Mutagenesis kit from Clontech, may be employed for introducing site- 
directed mutants into the GES gene of the present invention. Following denaturation of 
the target plasmid in this system, two primers are simultaneously annealed to the 
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plasmid; one of these primers contains the desired site-directed mutation, the other 
contains a mutation at another point in the plasmid resulting in elimination of a 
restriction site. Second strand synthesis is then carried out, tightly linking these two 
mutations, and the resulting plasmids are transformed into a mutS strain of E. coli. 
5 Plasmid DNA is isolated from the transformed bacteria, restricted with the relevant 

restriction enzyme (thereby linearizing the unmutated plasmids), and then retransformed 
into E. colt This system allows for generation of mutations directly in an expression 
plasmid, without the necessity of subjoining or generation of single-stranded phagemids. 
The tight linkage of the two mutations and the subsequent linearization of unmutated 

10 plasmids result in high mutation efficiency and allow minimal screening. Following 
synthesis of the initial restriction site primer, this method requires the use of only one 
new primer type per mutation site. Rather than prepare each positional mutant separately, 
a set of "designed degenerate" oligonucleotide primers can be synthesized in order to 
introduce all of the desired mutations at a given site simultaneously. Transformants can 

15 be screened by sequencing the plasmid DNA through the mutagenized region to identify 
and sort mutant clones. Each mutant DNA can then be restricted and analyzed to confirm 
that no other alterations in the sequence have occurred (e.g., by band shift comparison to 
the unmutagenized control). 

In the design of a particular site directed mutagenesis, it is generally desirable to 

20 first make a non-conservative substitution (e.g., Ala for Cys, His or Glu) and determining 
if activity is greatly impaired as a consequence. The properties of the mutagenized 
protein are then examined with particular attention to the kinetic parameters of K m and 
kcat as sensitive indicators of altered function, from which changes in binding and/or 
catalysis per site may be deduced by comparison to the native enzyme. If the residue is 

25 demonstrated to be important by activity impairment, or knockout, then conservative 

substitutions can be made, such as Asp for Glu to alter side chain length, Ser for Cys, or ( 
Arg for His. For hydrophobic segments, it is commonly size that is usefully altered, 
although aromatics can also be substituted for alkyl side chains. Changes in the normal 
product distribution can indicate which step(s) of the reaction sequence have been altered 

30 by the mutation. Modification of the hydrophobic pocket can be employed to change 
binding conformations for substrates. 
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Other site directed mutagenesis techniques might also be employed with the 
nucleotide sequences of the invention. For example, restriction endonuclease digestion of 
DNA followed by ligation may be used to generate deletion variants of GES, as 
described in section 15.3 of Sambrook et al. (Molecular Cloning: A Laboratory Manual, 
5 2nd Ed., Cold Spring Harbor Laboratory Press, New York, N. Y. 1989). A similar 
strategy may be used to construct insertion variants, as described in section 15.3 of 
Sambrook et al., supra. 

Oligonucleotide-directed mutagenesis may also be employed for preparing 
substitution variants of this invention. It may also be used to conveniently prepare the 

10 deletion and insertion variants of this invention. This technique is well known in the art 
as described, for example, by Adelman et al. (DNA 2:183 1983); Sambrook et al., supra; 
"Current Protocols in Molecular Biology", 1991, Wiley (NY), F. T. Ausubel, R. Brent, 
R. E. Kingston, D. D. Moore, J. D. Seidman, J. A. Smith and K. Struhl, eds. 

Generally, oligonucleotides of at least 25 nucleotides in length are used to insert, 

15 delete or substitute two or more nucleotides in the nucleic acid molecules encoding GES 
of the invention. An optimal oligonucleotide will have 12 to 15 perfectly matched 
nucleotides on either side of the nucleotides coding for the mutation. To mutagenize 
nucleic acids encoding the native GES of the invention, the oligonucleotide is annealed 
to the single-stranded DNA template molecule under suitable hybridization conditions. A 

20 DNA polymerizing enzyme, usually the Klenow fragment of E. coli DNA polymerase I, 
is then added. This enzyme uses the oligonucleotide as a primer to complete the 
synthesis of the mutation-bearing strand of DNA. Thus, a heteroduplex molecule is 
formed such that one strand of DNA encodes the native synthase inserted in the vector, 
and the second strand of DNA encodes the mutated form of the synthase inserted into the 

25 same vector. This heteroduplex molecule is then transformed into a suitable host cell. 

Mutants substituted with more than one amino acid may be generated in one of 
several ways. If the amino acids are located close together in the polypeptide chain, they 
may be mutated simultaneously using one oligonucleotide that codes for all of the 
desired amino acid substitutions. If however, the amino acids are located in some 

30 distance from each other (e.g., separated by more than ten amino acids) it is more 
difficult to generate a single oligonucleotide that encodes all of the desired changes. 
Instead, one of two alternative methods may be employed. In the first method, a separate 

17 
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oligonucleotide is generated for each substituted amino acid. The oligonucleotides are 
teiTannealed to the single-stranded template DNA simultaneously, and the second DNA 
strand synthesized from the template will encode all of the desired amino acid 
substitutions. An alternative method involves two or more rounds of mutagenesis to 

5 produce the desired mutant The first round is as described for the single mutants: native 
GES DNA is used for the template, an oligonucleotide encoding the first desired amino 
acid substitution is annealed to this template, and the heteroduplex DNA molecule is 
then generated. The second round of mutagenesis utilizes the mutated DNA produced in 
the first round of mutagenesis as the template. Thus, this template already contains one 

10 or more mutations. The oligonucleotide encoding the additional desired amino acid 
substitution(s) is then annealed to this template, and the resulting strand of DNA now 
encodes mutations from both the first and the second rounds of mutagenesis. The 
mutagenized DNA can then be used as a template in a third round of mutagenesis, and so 

on. 

According to one currently preferred embodiment, the GES of the present 
invention is originated from basil species, specifically from "Sweet Dani" cultivar. 

Sweet basil (Ocimum basilicum) such as "Sweet Dani" is particularly rich in citral. 
Over 99% of the monoterpenes present in "Sweet Dani" comprises of geraniol, nerol, 
geranial and neral. The accumulation of these monoterpenes during the maturation of the 
20 leaves was analyzed. Younger leaves had the highest levels of monoterpenes and the 
levels got progressively lower as the leaves mature and expand (table 1). 
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Table I. Levels of geranial, neral, geraniol and nerd in leaves of different age in the 
basil "Sweet Dani" cultivar. 





Geranial 


Neral 


Geraniol 


Nerol 


Small leaves 


1.80±0.31 a 


0.92 ± 0.20 


0.14 ±0.08 


0.05 ± 0.03 


Medium Leaves 


1.17 ±0.12 


0.57 ± 0.02 


0.4 ± 0.02 


0.02 ± 0.01 


Large Leaves 


0.83 ± 0.42 


0.38 ±0.13 


0.01 ± o.oo 


0.01 ± 0.00 



3 mg/g fresh leaves ± S JO. 

According to another aspect, the present invention provides methods for the 
30 production, isolation and purification of the GES according to the present invention, as 
well as of the products of its enzymatic activity. 
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t\ gene cuouumg may oe incorporated into any organism capable of 
synthesizing terpenes, or cell culture derived thereof. 

The geraniol-encoding gene may be incorporated into the organism for a variety of 
purposes, including but not limited to production of GES; production of geraniol; 
5 production of products downstream to geraniol; and production or modification of flavor 
and aroma compounds. 

According to one embodiment, the present invention provides an expression vector 
comprising a nucleic acid sequence encoding a GES. 

According to another embodiment, the present invention provides an expression 
10 vector comprising a nucleic acid sequence selected from the group consisting of: 

(e) the nucleic acid sequence of SEQ ID NO:l; 

(f) the complement of SEQ ID NO: 1; 

(g) a nucleic acid sequence which is at least 90% homologous to SEQ ID NO:l; 
and 

15 (h) a nucleic acid sequence capable of hybridizing either (a) or (b). 

According to yet another embodiment, the present invention provides an expression 
vector comprising a polynucleotide sequence encoding a GES having SEQ ID NO:2. 

According to further embodiment, the present invention provides an expression 
vector comprising a polynucleotide sequence encoding a GES which is at least 60%, 

20 preferably 70%, more preferably 80% or more, most preferably at least 90% or 100% 
homologous to SEQ ID NO:2. 

Vectors of various types may be used in the practice of the present invention. A 
specific vector type is used according to the host cell in which expression is desired, as is 
known to a person with ordinary skill in the art, and as described herein below. The 

25 vector usually has a replication site, marker genes that provide phenotypic selection in 
transformed cells, one or more promoters, and a polylinker region containing several 
restriction sites for insertion of foreign DNA. For example, plasmids typically used for 
transformation of E. coli include pBR322, pUC 18, pUC 19, pUCIlS, pUCl 19, and 
Bluescript M13, all of which are described in sections 1.12-1.20 of Sambrook et ah, 

30 supra. These vectors contain genes coding for ampicillin and/or tetracycline resistance, 
which enables cells transformed with these vectors to grow in the presence of these 
antibiotics. However, many other suitable vectors, harboring different genes encoding for 
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selection markers are available as well. The construction of suitable vectors containing 
DNA encoding replication sequences, regulatory sequences, phenotypic selection genes 
and the GES DNA of interest are prepared using standard recombinant DNA procedures. 
Isolated plasmids and DNA fragments are cleaved, tailored, and ligated together in a 
specific order to generate the desired vectors, as is well known in the art (see, for 
example, Sambrook et al., supra). 

According to yet another embodiment the present invention provides a host cell 
containing the expression vector of the present invention. The present invention further 
provides a method for the production of recombinant GES, the method comprising a) 
culturing the host cell containing the expression vector comprising at least a fragment of 
the polynucleotide sequence encoding GES under conditions suitable for the expression 
of the GES ; and b) recovering GES from the host cell culture. 

The host cell may be transformed with the expression vector according to the 
present invention by using any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The transformation process results 
in the expression of the inserted DNA such as to change the recipient cell into a 
transformed, genetically modified or transgenic cell. 

Potential cDNA encoding GES were examined by RT-PCR and cloning of full 
length cDNAs into the pCRT7/CT-TOPO TA vector (Invitrogen, Carlsbad, CA). These 
constructs were expressed in the E. coli expression system, and the resulted proteins 
(64.9 kDa) were tested for GES activity. Since the monoterpenes synthases N-teiminus 
serves as a transit peptide, which is cleaved after the protein is inserted into the organelle 
(Bohlmami at al 1998). Constructs encoding truncated GES proteins missing the first 34 
or 43 amino acids (the exact location of the cleavage site has not yet been determined 
with certainty) were also constructed in either pCRT7/CT-TOPO TA vector or a pET- 
11a vector. These constructs were expressed in the E. coli expression system, and the 
resulted proteins (61.2 and 60.2 kDa, respectively) were tested for GES activity. 
Conditions were worked out to obtain substantial amounts of active gene products. 

Recombinant GES proteins were harvested from E. coli cells (with the constructs), 
and were purified using several chromatographic steps (table 2). The purified proteins 
supplemented with Mn 2+ were found to convert radiolabeled GDP into geraniol (SPME- 
Gas chromatogram) as exemplified herein below (Figure 6C). The present invention thus 
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discloses the conversion of radiolabeled GDP into a putative monoterpene geraniol by 
the activity of GES gene product. 

Table 2: Purification of GES from gland of basil "Sweet Dani" cultivar 



5 





Total 
activity 
(ptat) 


Protein 
(nig) 


Specific 
activity 
(pkat/mg) 


Purification 


Yield (%) 


Crude 


3874.9 


23.77 


163.0 


1 


100 


DE-53 


1225.7 


2.71 


153.0 


2.8 


31.7 


Mono Q 


265.5 


0.43 


617.7 


3.8 


6.9 


Superosel2 


337.2 


0.05 


6244.2 


38.3 


8.7 



According to a further embodiment the present invention provides a method for 
producing significant amounts of geraniol, the method comprising a) culturing the host 
cell containing an expression vector comprising at least a fragment of the polynucleotide 
10 sequence encoding GES under conditions suitable for the expression and activity of the 
GES; and b) recovering geraniol from the host cell culture. 

According to yet another embodiment the present invention provides a method for 
producing significant amounts of geraniol metabolites downstream in the terpene 
biosynthesis pathway. According to one preferred embodiment the downstream geraniol 
15 metabolites are geranial and neral. 

The mixture of geraniol with his oxidation products, geranial and neral (also called 
citral), imparts a "lemon" flavor, and lemongrass, ginger, and some varieties of sweet 
basil such as "Sweet Dani" are particularly rich in citral (together with neral they 
constitute >70% of total essential oil terpenes) (fig 4A). Therefore, geraniol and his 
20 metabolites are of interest to the food and cosmetic industry. 

According to yet another aspect the present invention provides a prokaryotic or 
eukaryotic organism in which significant amounts of geraniol are synthesized. 

According to one embodiment, the present invention provides a prokaryotic 
organism comprising a polynucleotide sequence encoding a GES stably integrated into 
25 its genome. 

According to one embodiment, the present invention provides a prokaryotic 
organism comprising a nucleic acid sequence selected from the group consisting of SEQ 

* 
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ID NO:l, the complement of SEQ ID NO: 1, a nucleic acid sequence which is at least 
90% homologous to SEQ ID NO:l, a nucleic acid sequence capable of hybridizing to 
SEQ ID NO: 1 and a nucleic acid sequence capable of hybridizing to the complement of 
SEQ ID NO: 1, stably integrated into its genome. According to another embodiment, the 
5 present invention provides a prokaryotic organism comprising a polynucleotide sequence 
encoding a GES having the amino acid sequence of SEQ ID NO: 2 stably integrated into 
its genome. 

According to yet another embodiment the present invention provides a prokaryotic 
organism comprising a polynucleotide sequence encoding a GES comprising amino acid 
10 sequence which is at least 60%, preferably at least 70%, more preferably at least 80% or 
more, most preferably at least 90% or 100% homologous to SEQ ID N0:2 stably 
integrated into its genome. 

According to further embodiment, the present invention provides a prokaryotic 
organism comprising a polynucleotide encoding GES stably integrated into its genome, 
15 said prokaryotic organism produces geraniol. 

According to one preferred embodiment, the prokaryotic organism is selected from 
the group consisting of various bacteria. Preferably, the prokaryotic organism is selected 
from E. coli strains. 

As is known to a person skilled in the art, many bacterial strains are suitable as host 
20 cells for the over-expression of monoterpenes according to the present invention, 

including E. coli strains and many other species and genera of prokaryotes including 
bacilli such as Bacillus subtilis, other enterobacteriaceae such as Salmonella 
typhimurium or Serratia marcesans, and various Pseudomonas species. Prokaryotic host 
cells or other host cells with rigid cell walls are preferably transformed using the calcium 
25 chloride method as described in section 1.82 of Sambrook et aL, supra. Alternatively, 
electroporation may be used for transformation of these cells. Prokaryote transformation 
techniques are known in the art, e.g. Dower, W. J., in Genetic Engineering, Principles 
and Methods, 12:275-296, Plenum Publishing Corp., 1990; Hanahan et al., Meth. 
Enzymol., 204:63 1991. 
30 Although geraniol was reported to be emitted from flowers of many species, 

notably roses (Rao et al 2000; Antonelli et at 1997 r ; Bayrak et al 1994), and from 
vegetative tissues of many herbs (Mockute et at 1999; Charles et al 1992; Vieira et at 
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2001; Mallavarapu et al 1 998), the present invention discloses for the first time a gene 
encoding GES. Accordingly, to the best of our knowledge, there are no other known 
genes encoding GES from any other Genus. 

According to yet another aspect, the present invention provides geraniol and 
5 geraniol metabolite produced downstream in the monoterpene biosynthesis pathway 
obtained by the methods of the present invention for industrial uses. 

According to one embodiment, the present invention provides geraniol and 
geraniol metabolites obtained by the methods of the present invention for use in a 
product selected from the agricultural, cosmetic, and food products. 
10 According to another embodiment, the products downstream to geraniol are 

geranial and neral. 

The principles of the invention, disclosing novel monoterpene synthase, 
polynucleotides encoding same, methods of productions and methods for use may be 
better understood with reference to the following non-limiting examples. 

15 

EXAMPLES 

Plant material 

Seeds for "Sweet Dani" were obtained from a local nursery. They were sown in 
Horticultural vermiculite and put in the growth chamber until germination. One week 
20 after germination, each seedling was transplanted into 500 ml pot containing Sunshine 
Mix no. 1 polling soil and grown in a greenhouse under constant illumination. 

Volatile oil extraction from leaves and glands 

Basil leaves were categorized into three sizes, small (0.5-1.5 cm), medium (1.5-3 

25 cm), and large (3-4 cm). Two hundred milligram of each leaves were added to liquid N2 
and ground by mortar and pestle. The powder was soaked in 2 ml MTBE (methyl t-butyl 
ether) containing 0.1 mg linalool as internal standard (linalool was used because the 
"Sweet Dani" cultivar does not contain linalool) and extracted for 2 hours at room 
temperature in 5 ml glass vials with tightly sealed rubber septa caps. The MTBE upper 

30 layer, which included the volatile oil, was removed and placed into another vial and 

concentrated to 200 |il under gentle N2 gas flow for GC-MS analysis. Data points were 
obtained in triplicates. 
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Volatile oils were extracted from the glands of young leaves with a stretched glass 
pipette (Gang et al 2001). 

GC-MS analysis of plant volatiles 
5 A Shimadzu QP-5000 system (Shimadzu, Columbia, MD) equipped wrdi 

Shimadzu GC-17 gas chromatograph was used for GC-MS analysis of volatile 
compounds. Separation was performed on DB-WAX (30 m x 0.32 mm i.d. x 0.25 am 
film thickness, J&W Scientific, Folsom, CA) capillary column with electron impact 
mode (1.4 kV). However, some nonpolar compounds eluted with the solvent peak in the 

10 DB-WAX column, and their separation was achieved on CP-5 column (30m x 0.32mm 
i.d. x 1 \im film thickness, AUtech Associates Inc., Deerfield, DL). The oven 
temperature for DB-WAX methods was held at 60% for 2 mm and raised to 220°C at 
4°C/min with the injector set at 220°C and the interface set at 240°C. The GC condition 
for CP-5 method was same with previous report (Gang et al 2001). Ultrapure helium 

15 was used as the carrier gas at a rate of 1.3 ml/min. Samples (2 jllI) were injected by the 
Shimadzu AOC-17 Autoinjector. Eluted compounds were identified by comparing their 
retention time and mass fragmentation patterns with standard compounds. 

Example 1 : GES enzyme assays 

20 GES activity was assayed by incubating 5 jil of the enzyme sample in a final 

volume of 50 jal buffer containing 50mM Hepes-KOH, pH 8.0, ImM DTE, 0.5 mM 
MnCI2, 20mM MgC12, 10% glycerol and 54 \xM [1-3HJ-GDP (specific activity 20 
Ci/mol, American Radiolabeled Chemicals, St Louis, MO). After incubation for 30 mm 
at 320C, 160 jal hexane was added to the tube, and vortexed briefly and centrifuged to 

25 separated the phases. The hexane layer was directly placed into a scintillation vial 
containing 2 ml of nonaqueous scintillation fluid (Econo-Safe, Research Products 
International, Mount Prospect EL). This extraction procedure was repeated twice and the 
total hexane phase was counted by Liquid Scintillation counter (LS-6500 model, 
Beckman Coulter, Fullerton, CA). Boiled enzyme extracts were used as controls. 

30 
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Identification of Enzymatic Products 

GES enzyme assays were also performed by adding 100 of enzyme solution 
with 900 \i\ assay solution containing 54 \xM non-radioactive GDP (Echelon Research 
Laboratories, Salt Lake City, UT) and the same buffer described above. The reactions 

5 were carried out in an 8 mL DuPont autosampler vial with a white solid-top 

polypropylene cap (Alltech, Deerfield, EL). After letting the reaction proceed for 2-4 
hr at 32°C, the liberated compounds were collected with a solid-phase microextraction 
(SPME) device PDMS-100 with a polydimethylsiloxane fiber (Supelco, Bekkefonte, PA) 
by inserting the fiber into the tube and leaving it in for 20 mim at 42°C. After this 

10 incubation step, the solid-phase microextraction fiber was directly injected into the GC- 
MS; 

Example 2: Terpene synthase assays with 18 Q-Iabeled water 

Assays in buffer containing 180-labeled water were carried out in 2 ml glass vials 

15 with screw cap of PTFE/Silicone Septa (Supelco, Bllefonte, PA) by the addition 20 fil of 
purified enzyme (app. 1 jig protein) to 180 (il assay solution that contained 20 |il lOx 
assay buffer with 150 \i\ H2180 (95% atom, Icon Service, Summit, NJ) and 10 |il of 5.4 
mM GDP. The final concentration of H2180 in this assay solution was 71.3%. This 
solution was incubated for 2 hr at 320C, and cooled down on ice, then extracted with 200 

20 jil pentane. After concentration to 50 jil, 2 ^1 of this solution was injected to GC-MS 
system. To compare the mass spectra pattern, a pentane extract of the product from a 
reaction in which normal water (H2160) was used was also analyzed. 

Phosphatase activity assay 

25 Phosphatase activity was measured (Harnandez et al 1996) with the following 

modification: Assay samples were prepared by incubating 50 ^1 of enzyme solution in a 
final volume of 400 \xl assay buffer containing 2mM p-nitrophenyl phosphate as 
substrate. The buffer composition was the same as with the GES assay (but without 
GDP). After incubation for 1 h at room temperature, the reaction was stopped by adding 

30 of 700 |il of 0.2 M Na2C03. The yellow color generated from the hydrolysis of p- 
nitrophenyl phosphate was measured at 420 nm in a spectrophotometer (Beckman 
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DU530, Fullerton, CA). Phosphatase activity was calculated using a standard curve for 
p-nitrophenol. For the purified enzyme, this assay was scaled down ten-fold. 

F.Yam ple 3: GES purification 

All purification steps were carried out at 40C unless stated otherwise. Glands were 
isolated from approx. 300 g of Sweet Dani basil (Gang et al 2001), with a total yield of 4 
ml of resuspended glands. The gland preparation was diluted 10:1 (V/V) in ice-cold 
enzyme extraction buffer (lOOmM BisTris-HCI, pH 7.5, lOmM DTE, 5 mM Na2S204, 
2% (w/v) PVPP, 10% glycerol), and sonicated on ice, with rest intervals for cooling 
down, until gland cells were completely lysed. After centrifugation for 20 mm at 
10,000g, the supernatant (39 ml) was loaded onto a DEAE-cellulose column (10 mL of 
DE53, Whatman, Fairfield, NJ) installed in a Pharmacia FPLC apparatus and pre- 
equilibrated with a solution containing 50mM Tris-HCI, pH 7.5, 10% glycerol, and 10 
mM P-mercaptoethanol (buffer A). After elution of unbound material from the column 
with 25 ml of buffer A, GES activity was eluted with 200 ml of a linear gradient from 0 
to 1 M KCI in buffer A. The flow rate was 1.0 ml/min, and 3 ml fractions were collected 
and then assayed for GES activity. The fractions with the highest GES activity were 
pooled (KCI concentration of 255-435 mM, a total of 38 ml), and dialyzed in buffer A 
for 4 hr to remove KCI. This dialysis step did not result in any decrease in GES activity. 
The enzyme solution was subsequendy loaded onto strong anion exchange column 
(Mono Q, 0.5 cm x 6,0 cm, Pharmacia Biotech, Piscataway, NJ) pre-equilibrated with 

■ + - 

buffer A. After washing off the unbound material with 2 ml of buffer A, GES was eluted 
with 50 mL of a linear gradient 0-700 mM KCI in buffer A at 0.5 ml/min, and 1 ml 
fractions were collected. The highest GES activity was detected in the 2 ml fraction 
containing 294 mM KCI. To this fraction was added octyl glucoside (final concentration 
of 5 mM), and the enzyme was concentrated in an Ultrafree-4 centrifugal device 
(Millipore, Bedford, MA) to a total volume of 200 ul. The concentrated enzyme solution 
was loaded onto a size exclusion column (10mm x 300 mm) packed with Superose 12 
(Pharmacia) and active fraction was isocratically eluted with 100 mM KCI in Buffer A at 
0.2 ml/min. Fractions (0.5 ml each) were collected and protein purity was examined by 
SDSPAGE gel electrophoresis followed by Coomassie Brilliant Blue or silver staining of 
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the gel. The protein concentrations were measured by the Bradford method or by staining 
intensity on SDS-PAGE compared with BSA concentration standards. 

Molecular mass estimation 

Partially purified GES was run on a size exclusion column under the same 
conditions used during the purification procedure, except that 0.25 ml fractionations 
were collected instead, and fractions were assayed for GES activity. A standard curve 
was obtained by plotting the Ve/VO of the standard proteins against the log of the 
molecular weight. The protein standards used included cytochrome C (12.4 kDa), 
carbonic anhydrase (29 kDa), ovalbumin (45 kDa), bovine serum albumin (66 kDa), 
alcohol dehydrogenase (from equine liver, 80 kDa), and alcohol dehydrogenase (from 
yeast, 141 kDa). The subunit molecular weight was estimated by SDS-PAGE performed 
on 10% polyacrylamide gel and calibrated with molecular weight standard in the range 
of 14-212 kDa (New England BioLabs, MA). 

Example 4: Characterization of GES properties 

The pH optimum for GES activity was determined using three buffer systems. 
Reaction was carried out in 50 mM Bis-Tris buffer ranging from pH 6.0 to 7.0, 50 mM 
Tris-HCI buffer ranging pH 7.0 to 9.0, and 50 mM glycine-NaOH buffer ranging from 
pH 9.0 to 10.0. Temperature stability of GES was determined by incubating GES in 
temperatures ranging from 4 to 650C for 30 mm and then chilling the samples on ice, 
followed by enzyme assays at 320C. To determine the kinetic parameters of GES, the 
enzyme was diluted to the appropriate concentration and incubation time was set for 30 
mm at 320C. In determining the Km value for GDP. Mn2+ concentration was set at a 
saturated level and GDP concentration was changed from 0.5 pM to 108 \iM with 10 
different data points. The Km value for Mn2+ was measured at saturated GDP levels, . 
and Mn2± concentration was changed from 4 \iM to 1000 jjM with 9 data points. 
lineweaver-Burk plots were made to obtain the Km value. 

Example 5: BSI-MS/MS analysis of purified GES 

Mass spectrometric analysis of the purified GES was carried out in the Proteomics 
Core Facility of the Southwest Environmental Health Sciences Center and Arizona 
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Cancer Center, at The University of Arizona. Proteins were first separated by SDS- 
PAGE, as described above, stained lightly with Coomassie brilliant blue R250, excised 
from the gel and digested with trypsin (Shevchenko et al 1996). Extracted peptides were 
analyzed by liquid chromatography-tandem mass spectrometry (LC -MS/MS) using a 
5 ThermoFinnigan LCQ Classic quadrupole ion trap mass spectrometer (San Jose, CA) 

equipped with a Michrom MAGIC2002 HPLC (Auburn, CA) and a nanospray ion source 
(University of Washington). Peptides were loaded onto 10-cm capillaries (365 \im O.D. 
x 100 pm ID.; packed with 5-6cm of Vydac C18 material) that were pulled to 3-5 urn 
tips using a Suffer Instruments P2000 capillary puller (Novato, CA). Peptides were 

10 eluted at a flow rate of 200-300 nL-min-1 into the mass spectrometer using reversed 

phase solvent conditions (Shevchenko et al 1996). Tandem MS spectra of peptides were 
analyzed with the TurboSequest program to assign peptide sequences to the spectra (Eng 
et al 1994). TurboSequest analyses were performed against the sweet basil EST 
databases housed at the Arizona Genomics Institute at the University of Arizona. The 

15 non-identified spectra were further analyzed by ExPASy peptide mass program 

(http://us.expasy.orgltools/peptide-mass.html) for the calculated tryptic peptide masses 
from fulllength GES cDNA. 

Example 6: Isolation of GES cDNA5 and Expression in E. coli 
20 A basil cv. Sweet Dani peltate gland EST database containing 3,200 unique 

sequences was developed at the Arizona Genomics Institute and the Arizona Genomics 
Computational Laboratory at the University of Arizona, using a cDNA library 
constructed from gland mRNA5 (Gang et al 2001). BLAST searches revealed numerous 
ESTs with sequence similarity to terpene synthases. Potential cDNA5 encoding GES 
25 were examined by RT-PCR cloning of full length cDNA5 into the pCRT7/CT-TOPO TA 
vector (Invitrogen, Carlsbad, CA), expressing these constructs in the E. coli expression 
system, and testing the resulting proteins for activity with GDP (described in Chen et al 
2003). Constructs encoding truncated GES proteins missing the first 34 or 43 amino 
acids were also constructed (using the method described in Chen et al 2003) in either 
30 pCRT7/CT-TOPO TA vector or a pET-1 la vector. After harvesting recombinant GES 
proteins from E. coli cells, the proteins were purified using the same method employed 
in the purification of GES from basil glands. 
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Example 7: Sequence Analaysis 

Alignment of multiple protein sequences was performed using the ClustalX 
program (Thompson et at 1997). Sequence relatedness by the neighbor-joining method 
was determined using the protocol included in the ClustalX package. The phylogenic tree 
5 was drawn using the TREE VIEW program 

hftp://taxonomy.zoolOgy.gla.ac.uk/rOd/treeview.html (Page a all 996)), 
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